In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples". A sample is a value of the signal at a point in time and/or space; this definition differs from the term's usage in statistics, which refers to a set of such values.
A sampler is a subsystem or operation that extracts samples from a continuous signal. A theoretical ideal sampler produces samples equivalent to the instantaneous value of the continuous signal at the desired points.
The original signal can be reconstructed from a sequence of samples, up to the Nyquist limit, by passing the sequence of samples through a reconstruction filter.
For functions that vary with time, let be a continuous function (or "signal") to be sampled, and let sampling be performed by measuring the value of the continuous function every seconds, which is called the sampling interval or sampling period. Then the sampled function is given by the sequence:
Reconstructing a continuous function from samples is done by interpolation algorithms. The Whittaker–Shannon interpolation formula is mathematically equivalent to an ideal low-pass filter whose input is a sequence of Dirac delta functions that are modulated (multiplied) by the sample values. When the time interval between adjacent samples is a constant , the sequence of delta functions is called a Dirac comb. Mathematically, the modulated Dirac comb is equivalent to the product of the comb function with . That mathematical abstraction is sometimes referred to as impulse sampling.
Most sampled signals are not simply stored and reconstructed. The fidelity of a theoretical reconstruction is a common measure of the effectiveness of sampling. That fidelity is reduced when contains frequency components whose cycle length (period) is less than 2 sample intervals (see Aliasing). The corresponding frequency limit, in cycles per second (hertz), is cycle/sample × samples/second = , known as the Nyquist frequency of the sampler. Therefore, is usually the output of a low-pass filter, functionally known as an anti-aliasing filter. Without an anti-aliasing filter, frequencies higher than the Nyquist frequency will influence the samples in a way that is misinterpreted by the interpolation process.C. E. Shannon, "Communication in the presence of noise", Proc. Institute of Radio Engineers, vol. 37, no.1, pp. 10–21, Jan. 1949. Reprint as classic paper in: Proc. IEEE, Vol. 86, No. 2, (Feb 1998)
Various types of distortion can occur, including:
Although the use of oversampling can completely eliminate aperture error and aliasing by shifting them out of the passband, this technique cannot be practically used above a few GHz, and may be prohibitively expensive at much lower frequencies. Furthermore, while oversampling can reduce quantization error and non-linearity, it cannot eliminate these entirely. Consequently, practical ADCs at audio frequencies typically do not exhibit aliasing, aperture error, and are not limited by quantization error. Instead, analog noise dominates. At RF and microwave frequencies where oversampling is impractical and filters are expensive, aperture error, quantization error and aliasing can be significant limitations.
Jitter, noise, and quantization are often analyzed by modeling them as random errors added to the sample values. Integration and zero-order hold effects can be analyzed as a form of . The non-linearities of either ADC or DAC are analyzed by replacing the ideal linear function mapping with a proposed nonlinear function.
When it is necessary to capture audio covering the entire 20–20,000 Hz range of auditory system such as when recording music or many types of acoustic events, audio waveforms are typically sampled at 44.1 kHz (CD), 48 kHz, 88.2 kHz, or 96 kHz. The approximately double-rate requirement is a consequence of the Nyquist theorem. Sampling rates higher than about 50 kHz to 60 kHz cannot supply more usable information for human listeners. Early professional audio equipment manufacturers chose sampling rates in the region of 40 to 50 kHz for this reason.
There has been an industry trend towards sampling rates well beyond the basic requirements: such as 96 kHz and even 192 kHz Even though ultrasonic frequencies are inaudible to humans, recording and mixing at higher sampling rates is effective in eliminating the distortion that can be caused by foldback aliasing. Conversely, ultrasonic sounds may interact with and modulate the audible part of the frequency spectrum (intermodulation distortion), degrading the fidelity. One advantage of higher sampling rates is that they can relax the low-pass filter design requirements for ADCs and DACs, but with modern oversampling delta-sigma-converters this advantage is less important.
The Audio Engineering Society recommends 48 kHz sampling rate for most applications but gives recognition to 44.1 kHz for CD and other consumer uses, 32 kHz for transmission-related applications, and 96 kHz for higher bandwidth or relaxed anti-aliasing filtering. Both Lavry Engineering and J. Robert Stuart state that the ideal sampling rate would be about 60 kHz, but since this is not a standard frequency, recommend 88.2 or 96 kHz for recording purposes.
A more complete list of common audio sample rates is:
! Sampling rate ! Use | |
5,512.5 Hz | Supported in Adobe Flash. |
>8,000 Hz | [[Telephone]] and encrypted [[walkie-talkie]], wireless intercom and wireless microphone transmission; adequate for human speech but without [[sibilance]] (''ess'' sounds like ''eff'' (, )). |
>11,025 Hz | One quarter the sampling rate of audio CDs; used for lower-quality PCM, MPEG audio and for audio analysis of subwoofer bandpasses. |
>16,000 Hz | [[Wideband]] frequency extension over standard [[telephone]] [[narrowband]] 8,000 Hz. Used in most modern [[VoIP]] and [[VVoIP]] communication products. |
>22,050 Hz | One half the sampling rate of audio CDs; used for lower-quality PCM and MPEG audio and for audio analysis of low frequency energy. Suitable for digitizing early 20th century audio formats such as 78s and [[AM Radio]]. |
>32,000 Hz | Digitales Satellitenradio]], [[NICAM]] digital audio, used alongside analogue television sound in some countries. High-quality digital wireless microphones. Suitable for digitizing [[FM radio]]. |
>37,800 Hz | CD-XA audio |
>44,055.9 Hz | Used by digital audio locked to [[NTSC]] ''color'' video signals (3 samples per line, 245 lines per field, 59.94 fields per second = 29.97 frames per second). |
>44,100 Hz | [[Audio CD]], also most commonly used with MPEG-1 audio ([[VCD]], [[SVCD]], MP3). Originally chosen by [[Sony]] because it could be recorded on modified video equipment running at either 25 frames per second (PAL) or 30 frame/s (using an NTSC ''monochrome'' video recorder) and cover the 20 kHz bandwidth thought necessary to match professional analog recording equipment of the time. A [[PCM adaptor]] would fit digital audio samples into the analog video channel of, for example, [[PAL]] video tapes using 3 samples per line, 588 lines per frame, 25 frames per second. |
>47,250 Hz | world's first commercial [[PCM]] sound recorder by [[Nippon Columbia]] (Denon) |
>48,000 Hz | The standard audio sampling rate used by professional digital video equipment such as tape recorders, video servers, vision mixers and so on. This rate was chosen because it could reconstruct frequencies up to 22 kHz and work with 29.97 frames per second NTSC video – as well as 25 frame/s, 30 frame/s and 24 frame/s systems. With 29.97 frame/s systems it is necessary to handle 1601.6 audio samples per frame delivering an integer number of audio samples only every fifth video frame. Also used for sound with consumer video formats like DV, [[digital TV]], [[DVD]], and films. The professional serial digital interface (SDI) and High-definition Serial Digital Interface (HD-SDI) used to connect broadcast television equipment together uses this audio sampling frequency. Most professional audio gear uses 48 kHz sampling, including [[mixing console]]s, and digital recording devices. |
>50,000 Hz | First commercial digital audio recorders from the late 70s from 3M and [[Soundstream]]. |
>50,400 Hz | Sampling rate used by the Mitsubishi X-80 digital audio recorder. |
>64,000 Hz | Uncommonly used, but supported by some hardware and software. |
>88,200 Hz | Sampling rate used by some professional recording equipment when the destination is CD (multiples of 44,100 Hz). Some pro audio gear uses (or is able to select) 88.2 kHz sampling, including mixers, EQs, compressors, reverb, crossovers, and recording devices. |
>96,000 Hz | [[DVD-Audio]], some [[LPCM]] DVD tracks, [[BD-ROM]] (Blu-ray Disc) audio tracks, [[HD DVD]] (High-Definition DVD) audio tracks. Some professional recording and production equipment is able to select 96 kHz sampling. This sampling frequency is twice the 48 kHz standard commonly used with audio on professional equipment. |
>176,400 Hz | Sampling rate used by [[HDCD]] recorders and other professional applications for CD production. Four times the frequency of 44.1 kHz. |
>192,000 Hz | [[DVD-Audio]], some [[LPCM]] DVD tracks, [[BD-ROM]] (Blu-ray Disc) audio tracks, and [[HD DVD]] (High-Definition DVD) audio tracks, High-Definition audio recording devices and audio editing software. This sampling frequency is four times the 48 kHz standard commonly used with audio on professional video equipment. |
>352,800 Hz | Digital eXtreme Definition, used for recording and editing Super Audio CDs, as 1-bit Direct Stream Digital (DSD) is not suited for editing. 8 times the frequency of 44.1 kHz. |
384,000 Hz | Maximum sample rate available in common software. |
>2,822,400 Hz | SACD, 1-bit delta-sigma modulation process known as Direct Stream Digital, co-developed by [[Sony]] and [[Philips]]. |
>5,644,800 Hz | Double-Rate DSD, 1-bit Direct Stream Digital at 2× the rate of the SACD. Used in some professional DSD recorders. |
>11,289,600 Hz | Quad-Rate DSD, 1-bit Direct Stream Digital at 4× the rate of the SACD. Used in some uncommon professional DSD recorders. |
>22,579,200 Hz | Octuple-Rate DSD, 1-bit Direct Stream Digital at 8× the rate of the SACD. Used in rare experimental DSD recorders. Also known as DSD512. |
>45,158,400 Hz | Sexdecuple-Rate DSD, 1-bit Direct Stream Digital at 16× the rate of the SACD. Used in rare experimental DSD recorders. Also known as DSD1024. |
High-definition television (HDTV) uses 720p (progressive), 1080i (interlaced), and 1080p (progressive, also known as Full-HD).
In digital video, the temporal sampling rate is defined as the rather the than the notional pixel clock. The image sampling frequency is the repetition rate of the sensor integration period. Since the integration period may be significantly shorter than the time between repetitions, the sampling frequency can be different from the inverse of the sample time:
Video digital-to-analog converters operate in the megahertz range (from ~3 MHz for low quality composite video scalers in early game consoles, to 250 MHz or more for the highest-resolution VGA output).
When analog video is converted to digital video, a different sampling process occurs, this time at the pixel frequency, corresponding to a spatial sampling rate along . A common pixel sampling rate is:
Spatial sampling in the other direction is determined by the spacing of scan lines in the raster. The sampling rates and resolutions in both spatial directions can be measured in units of lines per picture height.
Spatial aliasing of high-frequency luma or chrominance video components shows up as a moiré pattern.
Although complex-valued samples can be obtained as described above, they are also created by manipulating samples of a real-valued waveform. For instance, the equivalent baseband waveform can be created without explicitly computing , by processing the product sequence, , through a digital low-pass filter whose cutoff frequency is . Computing only every other sample of the output sequence reduces the sample rate commensurate with the reduced Nyquist rate. The result is half as many complex-valued samples as the original number of real samples. No information is lost, and the original waveform can be recovered, if necessary.
|
|